Search results for "Minimum bounding box"

showing 10 items of 10 documents

Optimized spatio-temporal descriptors for real-time fall detection: comparison of support vector machine and Adaboost-based classification

2013

We propose a supervised approach to detect falls in a home environment using an optimized descriptor adapted to real-time tasks. We introduce a realistic dataset of 222 videos, a new metric allowing evaluation of fall detection performance in a video stream, and an automatically optimized set of spatio-temporal descriptors which fed a supervised classifier. We build the initial spatio-temporal descriptor named STHF using several combinations of transformations of geometrical features (height and width of human body bounding box, the user’s trajectory with her/his orientation, projection histograms, and moments of orders 0, 1, and 2). We study the combinations of usual transformations of the…

Contextual image classificationComputer sciencebusiness.industryFeature extractionWavelet transformFeature selectionPattern recognitionAtomic and Molecular Physics and OpticsComputer Science ApplicationsSupport vector machineMinimum bounding boxRobustness (computer science)Computer visionAdaBoostArtificial intelligenceElectrical and Electronic EngineeringbusinessJournal of Electronic Imaging
researchProduct

On the use of Deep Reinforcement Learning for Visual Tracking: a Survey

2021

This paper aims at highlighting cutting-edge research results in the field of visual tracking by deep reinforcement learning. Deep reinforcement learning (DRL) is an emerging area combining recent progress in deep and reinforcement learning. It is showing interesting results in the computer vision field and, recently, it has been applied to the visual tracking problem yielding to the rapid development of novel tracking strategies. After providing an introduction to reinforcement learning, this paper compares recent visual tracking approaches based on deep reinforcement learning. Analysis of the state-of-the-art suggests that reinforcement learning allows modeling varying parts of the tracki…

General Computer ScienceComputer scienceFeature extractionMachine learningcomputer.software_genreField (computer science)video-surveillanceMinimum bounding boxReinforcement learningGeneral Materials ScienceSettore ING-INF/05 - Sistemi Di Elaborazione Delle Informazionideep reinforcement learningComputer vision machine learning video-surveillance deep reinforcement learning visual tracking.business.industryGeneral EngineeringTracking systemvisual trackingVisualizationActive appearance modelTK1-9971machine learningEye trackingComputer visionArtificial intelligenceElectrical engineering. Electronics. Nuclear engineeringbusinesscomputer
researchProduct

A Semantic Web Approach for Geodata Discovery

2013

International audience; Currently, vast amounts of geospatial information are o ffered through OGC's services. However this information has limited formal semantics. The most common method to search for a dataset consists in matching keywords to metadata elements. By adding semantics to available descriptions we could use modern inference and reasoning mechanisms currently available in the SemanticWeb. In this paper we present a novel architecture currently in development in which we use state of the art triplestores as the backend of a CSW service. In our approach, each metadata record is considered an instance of a given class in a domain ontology. Our architecture also adds a spatial dat…

Geospatial analysisComputer scienceFormal semantics (linguistics)SemanticWeb0211 other engineering and technologiesInference02 engineering and technologyOntology (information science)computer.software_genreWorld Wide Web[ INFO.INFO-HC ] Computer Science [cs]/Human-Computer Interaction [cs.HC]Minimum bounding box0202 electrical engineering electronic engineering information engineeringontologies[INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]ArchitectureSemantic Webcatalogues021101 geological & geomatics engineeringInformation retrievalMetadataOGC servicesgeodata discovery020201 artificial intelligence & image processing[INFO.INFO-HC] Computer Science [cs]/Human-Computer Interaction [cs.HC]computer
researchProduct

Computer-aided detection of cerebral microbleeds in susceptibility-weighted imaging.

2014

Susceptibility-weighted imaging (SWI) is recognized as the preferred MRI technique for visualizing cerebral vasculature and related pathologies such as cerebral microbleeds (CMBs). Manual identification of CMBs is time-consuming, has limited reliability and reproducibility, and is prone to misinterpretation. In this paper, a novel computer-aided microbleed detection technique based on machine learning is presented: First, spherical-like objects (potential CMB candidates) with their corresponding bounding boxes were detected using a novel multi-scale Laplacian of Gaussian technique. A set of robust 3-dimensional Radon- and Hessian-based shape descriptors within each bounding box were then ex…

Hessian matrixComputer sciencePosterior probabilityHealth InformaticsBlob detectionSensitivity and SpecificityPattern Recognition AutomatedMachine Learningsymbols.namesakeMinimum bounding boxBounding overwatchImage Interpretation Computer-AssistedHumansRadiology Nuclear Medicine and imagingComputer visionComputer SimulationReliability (statistics)Cerebral HemorrhageObserver VariationModels StatisticalRadiological and Ultrasound TechnologyRadon transformbusiness.industryReproducibility of ResultsPattern recognitionImage EnhancementComputer Graphics and Computer-Aided DesignRandom forestDiffusion Magnetic Resonance ImagingData Interpretation StatisticalsymbolsComputer Vision and Pattern RecognitionArtificial intelligencebusinessAlgorithmsMagnetic Resonance AngiographyComputerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society
researchProduct

Autonomous Mooring towards Autonomous Maritime Navigation and Offshore Operations

2020

Bollard is a vital component of mooring system. It is the anchor point for mooring ropes to be fixed in order to secure the vessel or ship. An algorithm that translates the segmented mask of bollard output from masked R-CNN along with bounding box and associated class probability to its corresponding edge coordinate and finally to the single reference point for efficient detection and classification of bollard towards autonomous mooring is presented. At first stage, Mask R-CNN framework is trained with custom built bollard. The model obtained from the training is inferred with real data resulting in instance segment of bollard. The segmented mask obtained contains relatively large amount of…

Point of interestbusiness.industryComputer science020209 energy02 engineering and technologyMooringMinimum bounding box0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingPoint (geometry)Submarine pipelineComputer visionArtificial intelligenceEnhanced Data Rates for GSM EvolutionbusinessReduction (mathematics)Pose2020 15th IEEE Conference on Industrial Electronics and Applications (ICIEA)
researchProduct

EFFICIENT MACHINE LEARNING FRAMEWORK FOR COMPUTER-AIDED DETECTION OF CEREBRAL MICROBLEEDS USING THE RADON TRANSFORM

2014

International audience; Recent developments of susceptibility weighted MR techniques have improved visualization of venous vasculature and underlying pathologies such as cerebral microbleed (CMB). CMBs are small round hypointense lesions on MRI images that are emerging as a potential biomarker for cerebrovascular disease. CMB manual rating has limited reliability, is time-consuming and is prone to errors as small CMBs can be easily missed or mistaken for venous crosssections. This paper presents a computer-aided detection technique that utilizes a novel cascade of random forest classifiers which are trained on robust Radon-based features with an unbalanced sample distribution. The training …

Radon transformbusiness.industryBlob detection030218 nuclear medicine & medical imagingVisualizationRandom forest03 medical and health sciences0302 clinical medicineSampling distribution[INFO.INFO-TI] Computer Science [cs]/Image Processing [eess.IV]Minimum bounding box[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]Susceptibility weighted imaging[ INFO.INFO-TI ] Computer Science [cs]/Image ProcessingComputer visionArtificial intelligenceSensitivity (control systems)business030217 neurology & neurosurgeryMathematics
researchProduct

Definition and Performance Evaluation of a Robust SVM Based Fall Detection Solution

2012

We propose an automatic approach to detect falls in home environment. A Support Vector Machine based classifier is fed by a set of selected features extracted from human body silhouette tracking. The classifier is followed by filtering operations taking into account the temporal nature of a video. The features are based on height and width of human body bounding box, the user's trajectory with her/his orientation, Projection Histograms and moments of order 0, 1 and 2. We study several combinations of usual transformations of the features (Fourier Transform, Wavelet transform, first and second derivatives), and we show experimentally that it is possible to achieve high performance using a si…

Support vector machineDiscrete wavelet transformWaveletMinimum bounding boxComputer sciencebusiness.industryRobustness (computer science)Margin classifierFeature extractionWavelet transformPattern recognitionArtificial intelligencebusiness2012 Eighth International Conference on Signal Image Technology and Internet Based Systems
researchProduct

Alternative method for binary shape alignment of non-symmetrical shapes based on minimal enclosing box

2012

Proposed is a novel method based on the minimal enclosing box (MEB) to determine the canonical orientation associated with a three-dimensional binary shape. It is suggested that, when the shape has no clear distinctive features and two or more of the eigenvalues are similar, this method is more suitable than the commonly used method based on principal component analysis (PCA). An experiment is performed with shapes of human livers by measuring the degree on which a prototypical image (atlas) matches to a new shape after alignment by PCA, minimal area projection (MAP), and MEB showing that in this case MEB outperforms the usual PCA-based alignment method and also the MAP method.

business.industryAtlas (topology)Binary numberPattern recognitionComputer Science::Computational GeometryTopologyComputational geometryImage (mathematics)Minimum bounding boxPrincipal component analysisArtificial intelligenceElectrical and Electronic EngineeringbusinessProjection (set theory)Eigenvalues and eigenvectorsMathematicsElectronics Letters
researchProduct

Head Pose Estimation for Sign Language Video

2013

We address the problem of estimating three head pose angles in sign language video using the Pointing04 data set as training data. The proposed model employs facial landmark points and Support Vector Regression learned from the training set to identify yaw and pitch angles independently. A simple geometric approach is used for the roll angle. As a novel development, we propose to use the detected skin tone areas within the face bounding box as additional features for head pose estimation. The accuracy level of the estimators we obtain compares favorably with published results on the same data, but the smaller number of pose angles in our setup may explain some of the observed advantage.

business.industryComputer scienceComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONSign language3D pose estimationMotion captureData setSupport vector machineMinimum bounding boxFace (geometry)Computer visionArtificial intelligencebusinessPose
researchProduct

Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation

2021

International audience; Deep learning-based image understanding techniques require a large number of labeled images for training. Few-shot semantic segmentation, on the contrary, aims at generalizing the segmentation ability of the model to new categories given only a few labeled samples. To tackle this problem, we propose a novel prototypical network (MAPnet) with multiscale feature attention. To fully exploit the representative features of target classes, we firstly extract rich contextual information of labeled support images via a multiscale feature enhancement module. The learned prototypes from support features provide further semantic guidance on the query image. Then we adaptively i…

business.industryComputer scienceDeep learningFeature extractionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Pattern recognition02 engineering and technologyImage segmentation010501 environmental sciencesSemantics01 natural sciencesImage (mathematics)[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Minimum bounding boxFeature (computer vision)0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingSegmentationArtificial intelligencebusiness0105 earth and related environmental sciences
researchProduct